Visualize Token (vocabulary) Frequency Distribution Before Removing Stop Words

Visualize Token (vocabulary) Frequency Distribution After Removing Stop Words

Bigrams Frequency Distribution Before Removing Stop Word

Bigrams Frequency Distribution After Removing Stop Word

Trigrams Frequency Distribution Before Removing Stop Word

Hotel Description Length Distribution

The data was collected by myself, so there relatively clean, no extreme outliers.

Preprocessing hotel description text

The test is pretty clean, we don't have a lot to do, but just in case.

The following are recommended by Google for "Hilton Seattle Airport & Conference Center":

image.png

The following are recommended by Tripadvisor for "Hilton Seattle Airport & Conference Center":

image.png

Try a bed and breakfast

The following are recommended by Google for "The Bacon Mansion Bed and Breakfast":

image.png

Cool! Almost identical.

The following are recommended by Tripadvisor for "The Bacon Mansion Bed and Breakfast", which I was not impressed.

image.png